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Today's Date: n . g^*> 



What date would you like to use to limit the search? 
Priority Date: Other 



Name Vju^iC : 

AU Examiner # ^Zfr vV 

Room # Phone - 3T<^- 
Serial # ^ 



Format fo r Search Results (Circle One): 

C paper^ J) disk email 

Where have you searched so far? 
JJSg? DWPI EPO JPO ACM IBM TDB 
EEE^ INSPEC SPI Other 



Is this a "Fast & Focused" Search Request? (Circle One) (JEjF) NO 

A "Fast & Focused" Search is completed in 2-3 hours (maximum). The search must be on a very specific topic and 
meet certain criteria. The criteria are posted in EIC2100 and on the EIC2100 NPL Web Page at 
http://ptoweb/patents/stic/stic-tc21 00. htm. 



What is the topic novelty, motivation, utility, or other specific details defining the desired focus of this search? Please 
include the concepts, synonyms, keywords, acronymSj definitions, strategies, and anything else that helps to describe 
the topic. Please attach a copy of the abstract, background, brief summary, pertinent claims and any citations of 
relevant art you have found. ; 
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Set Items Description 

51 2272566 PARS? OR TOKENI? OR MAP OR MAPPING OR MAPPED OR SEGREGAT? - 

OR (FILTER OR PULL) ( ) OUT OR EXTRACT? 

52 46910 IDENTIFIER? OR ID(N) (NUMBER OR TAG) OR PATENT () NUMBER? OR - 

UPC OR PRODUCT (N) (NUMBER? OR CODE?) OR UPN OR URN OR DOI 

53 4066679 CUT (N) PASTE? OR PASTING OR SELECT? OR HIGHLIGHT? OR DROP? - 

OR DRAG (N) DROP? 

54 7103946 SELECT? OR CHOOS? OR SEARCH? OR QUER? OR SEEK? OR FIND? OR 

RETRIEV? OR MATCH? 

55 14625136 DOCUMENT? OR TEXT? OR PAGE? OR PUBLICATION? OR PAPER? OR I - 

NFORMATION? OR DATA OR PATENT? () APPLICATION? 

56 13 9 SI AND S2 AND S3 AND S4 AND S5 

57 148 S1(3N)S2 

58 20 S6 AND S7 

59 148141 S1(3N)S5 

510 0 Sll AND S12 

511 21956 S3(3N) (QUERY OR QUERIES OR REQUEST OR QUESTION? OR REQUESTS 

OR TEXT? OR INQUIR?) 

512 5781 S5(N) (NUMBER? OR ID OR IDENTIFIER? OR IDS) 

513 139 S6(N)S2 

514 2434941 MATCH? OR RECOGNI? OR IDENTIFY OR IDENTIFIES 

515 1230 S5(2N)S2 

516 32 S6 AND S9 

517 2 S6 AND Sll 

518 61 S14(5N)S12 

519 5 S14(2N) (PATENTO (NO OR NUMBER OR ID OR IDENTIFIER?)) 

520 52 S8 OR S16 OR S17 OR S19 

521 38 RD (unique items) 

522 27 S21 NOT PY>2000 

523 12 S18 AND SI 

524 7 S18 AND S3 

525 44 S23 OR S24 OR S22 

526 39 RD (unique items) 

527 36 S26 NOT PY>2000 

528 22 0 (TEXT OR PATENT OR TELEPHONE OR DOCUMENT) (N) (NUMBER OR IDE- 

NTIFIER? OR ID) (5N) (TEXT OR FREETEXT OR FULLTEXT) (3N) (SEARCH? 
OR QUER? OR RETRIEV?) 

529 32 S28 AND SI 

530 39 S28 AND S3 

531 63 S29 OR S30 

532 46 RD (unique items) 

533 32 S32 NOT PY>2000 

534 32 S33 NOT S27 

File 8:Ei Compendex(R) 1970-2005/May W5 

(c) 2005 Elsevier Eng. Info. Inc. 
File 35 :Dissertation Abs Online 1861-2005/May 

(c) 2005 ProQuest Inf o&Learning 
File 65: Inside Conferences 1993 -2005/Jun Wl 

(c) 2005 BLDSC all rts. reserv. 
File 2:INSPEC 1969 -2005/May W5 

(c) 2005 Institution of Electrical Engineers 
File 94 : JICST-EPlus 198 5 -2 005/Apr W3 

(c)2005 Japan Science and Tech Corp(JST) 
File 111 : TGG Nat 1 . Newspaper Index(SM) 1979-2005/Jun 03 

(c) 2005 The Gale Group 
File 6:NTIS 1964 -2005/May W5 

(c) 2005 NTIS, Intl Cpyrght All Rights Res 
File 144:Pascal 1973 -2005/May W5 

(c) 2005 INIST/CNRS 
File 434 : Sci Search (R) Cited Ref Sci 1974 - 1989/Dec 

(c) 1998 Inst for Sci Info 
File 34 :SciSearch(R) Cited Ref Sci 1990-2005/May W5 

(c) 2005 Inst for Sci Info 
File 99:Wilson Appl . Sci & Tech Abs 1983 -2005/May 

(c) 2005 The HW Wilson Co. 



File 95 :TEME-Technology & Management 1989-2005/Apr W4 
(c) 2005 FIZ TECHNIK 



34/5/3 (Item 3 from file: 8) 

DIALOG (R) File 8:Ei Compendex (R) 

(c) 2005 Elsevier Eng. Info. Inc. All rts . reserv. 

04452578 E.I. No: EI P96 043 156751 

Title: Intelligent retrieval of medical images from the Internet 

Author: Tang, Yau-Kuo; Chiang, Ted T. 

Corporate Source: Loral AeroSys, Seabrook, MD, USA 

Conference Title: Medical Imaging 1996: PACS Design and Evaluation: 
Engineering and Clinical Issues 

Conference Location: Newport Beach, CA, USA Conference Date: 
19960213-19960215 

Sponsor: SPIE - Int Soc for Opt Engineering, Bellingham, WA USA 

E.I. Conference No.: 22519 

Source: Proceedings of SPIE - The International Society for Optical 
Engineering v 2711 1996. Society of Photo-Optical Instrumentation 
Engineers, Bellingham, WA, USA. p 440-448 

Publication Year: 1996 

CODEN: PSISDG ISSN: 0277-786X ISBN: 0-8194-2086-7 
Language: English 

Document Type: CA; (Conference Article) Treatment: A; (Applications) 
Journal Announcement: 9609W3 

Abstract: The object of this study is using Internet resources to provide 
a cost-effective, user- friendly method to access the medical image archive 
system and to provide an easy method for the user to identify the images 
required. This paper describes the prototype system architecture, the 
implementation, and results. In the study, we prototype the Intelligent 
Medical Image Retrieval (IMIR) system as a Hypertext Transport Prototype 
server and provide Hypertext Markup Language forms for user, as an Internet 
client, using browser to enter image retrieval criteria for review. We are 
developing the intelligent retrieval engine, with the capability to map 
the free text search criteria to the standard terminology used for medical 
image identification. We evaluate retrieved records based on the number 
of the free text entries matched and their relevance level to the 
standard terminology. We are in the integration and testing phase. We have 
collected only a few different types of images for testing and have trained 
a few phrases to map the free text to the standard medical terminology. 
Nevertheless, we are able to demonstrate the IMIR's ability to search, 
retrieve, and review medical images from the archives using general 
Internet browser. The prototype also uncovered potential problems in 
performance, security, and accuracy. Additional studies and enhancements 
will make the system clinically operational. 8 Ref s . 

Descriptors: *Medical imaging; Information retrieval systems; Artificial 
intelligence; Data transfer; Computer networks; Communication systems 

Identifiers: Intelligent retrieval engines; Internet; Hypertext transport 
; Medical terminology 

Classification Codes: 



34/5/7 (Item 7 from file: 8) 

DIALOG (R) File 8 : Ei Compendex (R) 

(c) 2005 Elsevier Eng. Info. Inc. All rts. reserv. 

02237811 E.I. Monthly No: EIM8703 - 020941 

Title: ON THE USE OF KNOWLEDGE -BASED PROCESSING IN AUTOMATIC TEXT 
RETRIEVAL . 

Author: Salton, Gerard 

Corporate Source: Cornell Univ, Ithaca, NY, USA 

Conference Title: ASIS '86, Proceedings of the 49th ASIS Annual Meeting. 
Conference Location: Chicago, IL, USA Conference Date: 19860928 
Sponsor: ASIS, Washington, DC, USA 
E.I. Conference No.: 09182 

Source: Proceedings of the ASIS Annual Meeting v 23 1986. Publ by Learned 
Information Inc, Medford, NJ, USA p 277-287 
Publication Year: 1986 

CODEN: PAISDQ ISSN: 0044-7870 ISBN: 0-938734-14-8 
Language: English 

Document Type: PA; (Conference Paper) 
Journal Announcement: 8703 

Abstract: The representation of document and information content by means 
of single terms extracted from document texts is not always adequate for 
text processing purposes. Term relations or associations are therefore 
often used for the construction of complex identifiers to be attached to 
the information items. The complex identifications include term phrases 
generated by using the occurrence characteristics of certain words in 
document texts, and synonym classes specified in a thesaurus. The 
experimental evidence indicates that substantial difficulties arise in 
obtaining effective complex text identifiers that actually help in 
retrieval . To replace the existing text analysis methods, artificial 
intelligence approaches are often proposed based on the use of stored 
knowledge bases and expert system approaches. The main components of 
advanced artificial intelligence systems are briefly examined, and the 
conclusion is reached that the artificial intelligence methods are likely 
to be even more difficult to apply to normal document environments than the 
conventional text analysis methodologies. (Author abstract) 38 refs. 

Descriptors: * INFORMATION RETRIEVAL SYSTEMS; ARTIFICIAL INTELLIGENCE- - 
Expert Systems; INFORMATION SCIENCE- - Indexing 

Identifiers: AUTOMATIC TEXT RETRIEVAL ; KNOWLEDGE -BASED SYSTEMS; CONTENT 
ANALYSIS • 

Classification Codes: 

723 (Computer Software) ; 903 (Information Science) 

72 (COMPUTERS & DATA PROCESSING) ; 90 (GENERAL ENGINEERING) 



27/5/1 (Item 1 from file: 8) 

DIALOG (R) File 8 : Ei Compendex(R) 

(c) 2005 Elsevier Eng. Info. Inc. All rts . reserv. 

05121104 E.I. No: EI P98094386323 

Title: Combining laboratory data sets from multiple institutions using 
the logical observation identifier names and codes (LOINC) 

Author: Baorto, David M. ; Cimino, James J.; Parvin, Curtis A.; Kahn, 
Michael G. 

Corporate Source: Washington Univ, St. Louis, MO, USA 

Source: International Journal of Medical Informatics v 51 n 1 Jul 1998. p 
29-37 

Publication Year: 1998 

CODEN: IJMIF4 ISSN: 1386-5056 

Language: English 

Document Type: JA; (Journal Article) Treatment: A; (Applications) 
Journal Announcement: 9811W3 

Abstract: A standard set of names and codes for laboratory test results 
is critical for any endeavor requiring automated data pooling, including 
multi -institutional research and cross-facility patient care. This need had 
led to the development of the logical observation identifier names and 
codes (LOINC) database and its test-naming convention. This study is an 
expansion of a pilot study using LOINC to exchange laboratory data 
between Columbia University Medical Center in New York and Barnes Hospital 
at Washington University in St. Louis, where we described complexities and 
ambiguities that arose in the LOINC coding process (D.M. Baorto, J.J. 
Cimino, C.A. Parvin, M.G. Kahn, Proc . Am. Med. Inf. Assoc. 1997). For the 
present study, we required the same two medical centers to again extract 
raw laboratory data from their local information system for a defined 
patient population, translate tests into LOINC and provide aggregate data 
which could then be used to compare laboratory utilization. Here we examine 
a larger number of tests from each site which have been recorded using an 
updated version of the LOINC database. We conclude that the coding of local 
tests into LOINC can often be complex, especially the 'Kind of Property* 
field and apparently trivial differences in choices made by individual 
institutions can result in nonmatches in electronically pooled data . In 
the present study, 75% of failures to match the same tests between 
different institutions using LOINC codes were due to differences in local 
coding choices. LOINC has the potential to eliminate the need for detailed 
human inspection during the pooling of laboratory data from diverse sites, 
and perhaps even a built-in capability to adjust matching stringency by 
selecting subsets of LOINC fields required to match . However, a quality 
standard coding procedure is required and examples highlighted in this 
paper may require special attention while mapping to LOINC. (Author 
abstract) 12 Ref s . 

Descriptors: *Medical computing; Data structures; Database systems; 
Codes (symbols); Hospitals; Societies and institutions; Hospital data 
processing; Health care 

Identifiers: Laboratory data sets; Multiple institutions; Logical 
observation identifier names and codes 

Classification Codes: 

901.1.1 (Societies & Institutions) 

723.5 (Computer Applications) ; 461.1 (Biomedical Engineering); 723.2 
(Data Processing); 723.3 (Database Systems); 462.2 (Hospitals, Equipment 
& Supplies); 901.1 (Engineering Professional Aspects) 

723 (Computer. Software) ; 461 (Biotechnology) ; 462 (Medical Engineering 
& Equipment) ; 901 (Engineering Profession) 

72 (COMPUTERS & DATA PROCESSING) ; 46 (BIOENGINEERING) ; 90 (GENERAL 
ENGINEERING) 



27/5/3 (Item 3 from file: 8) 

DIALOG (R) File 8 : Ei Compendex(R) 

(c) 2005 Elsevier Eng. Info. Inc. All rts . reserv. 



04955677 E.I. No: EIP98024 09203 5 

Title: Classifying and retrieving software components based on profiles 

Author: Hong, S.B.; Kim, Kapsu 

Corporate Source:' Electronics and Telecommunications Research Inst, 
Taejon, South Korea 

Conference Title: Proceedings of the 1997 1st International Conference on 
Information, Communications and Signal Processing, ICICS. Part 3 (of 3) 

Conference Location: Singapore, Singapore Conference Date: 

19970909-19970912 

Sponsor: IEEE 

E.I. Conference No.: 48010 

Source: Trends in Information Systems Engineering and Wireless Multimedia 
Communications Proceedings of the International Conference on Information, 
Communications and Signal Processing, ICICS v 3 1997. IEEE, Piscataway, NJ, 
USA. p 1756-1760 

Publication Year: 1997 

CODEN: 002795 

Language: English 

Document Type: CA; (Conference Article) Treatment: G; (General Review) 
Journal Announcement: 98 04W4 

Abstract: We propose that the Software Reuse System can classify, 
register, and retrieve software components based on their profiles. There 
are two profiles : Object Profiles are constructed by extracting from 
software components their identifiers , function identifiers , and 
variable identifiers . Virtual Profiles are made by extracting common 
identifiers and their weights from Object Profiles or Virtual Profiles. By 
similarity function, the similarity values between profiles and software 
components are calculated, classified, registered by their value. To 
retrieve software components, keywords representing the software 
components and their weights are inputted by users. The similarity value of 
keywords and profiles is calculated, and software components with most high 
similarity value are retrieved . This system can register and retrieve 
software components more easily than other system and classify and 
retrieve software components faster than the systems using conventional 
information retrieval method. (Author abstract) 8 Ref s . 

Descriptors: *Computer software selection and evaluation; Computer 
aided software engineering; Information retrieval systems; Database 
systems 

Identifiers: Software reuse systems; Virtual profiles; Object profiles 
Classification Codes: 

723.1 (Computer Programming); 723.5 (Computer Applications) ; 903.3 
(Information Retrieval & Use); 723.3 (Database Systems) 
723 (Computer Software) ; 903 (Information Science) 
72 (COMPUTERS & DATA PROCESSING) ; 90 (GENERAL ENGINEERING) 



27/5/4 (Item 4 from file: 8) 

DIALOG (R) File 8 : Ei Compendex(R) 

(c) 2005 Elsevier Eng. Info. Inc. All rts. reserv. 

04675987 E.I. No: EIP97043 627828 

Title: Selection of passages for information reduction 
Author: Daniels, Jody J. 

Corporate Source: Univ of Massachusetts, Amherst, MA, USA 

Conference Title: Proceedings of the 1996 13th National Conference on 
Artificial Intelligence. Part 2 (of 2) 

Conference Location: Portland, OR, USA Conference Date: 

19960804-19960808 

Sponsor: AAAI 

E.I. Conference No.: 46255 

Source: Proceedings of the National Conference on Artificial Intelligence 
v 2 1996. AAAI, Menlo Park, CA, USA. p 1360 
Publication Year: 1996 
CODEN: PNAIEE 
Language: English 

Document Type: CA; (Conference Article) Treatment: G; (General Review) 
Journal Announcement: 9706W2 

Abstract: Selection of Passages for Information REduction (SPIRE) 
integrates a case based reasoning with an information retrieval (IR) 
engine for automated information extraction . SPIRE works by focusing on 
portions of a text most likely to contain the desired informations . 
This case-based reasoning (CBR) system generates an IR query by passing 
the identifiers of the documents , describing fact situations most 
similar to the current problem, to an IR engine. Using these annotations, 
the IR component generates a new query aimed at retrieving small 
relevant passages from the documents . The location and display of these 
important passages reduces reading and results in a tremendous savings in 
time and effort. 1 Ref s . 

Descriptors: *Informatio n retrieval systems; Information technology 
; Inference engines; Query languages; Knowledge based systems; Knowledge 
representation 

Identifiers: Selection of passages for information reduction (SPIRE); 
Case based reasoning (CBR) systems; Automated information extraction 

Classification Codes : 
723.4.1 (Expert Systems) 

903.3 (Information Retrieval & Use); 723.5 (Computer Application's); 
723.4 (Artificial Intelligence); 723.3 (Database Systems) 
903 (Information Science) ; 723 (Computer Software) 
90 (GENERAL ENGINEERING) ; 72 (COMPUTERS & DATA PROCESSING) 



27/5/16 (Item 9 from file: 2) 

DIALOG (R) File 2 : INSPEC 

(c) 2005 Institution of Electrical Engineers. All rts. reserv. 

00446867 INSPEC Abstract Number: C72023814 
Title: Data processor 
Assignee (s): RCA Corp 

Patent Number: GB 1280772 Issue Date: 720705 
Application Date: 700121 

Priority Appl . Number: US 793043 Priority Appl . Date: 690122 
Country of Publication: UK 

Language: English Document Type: Patent (PT) 
Treatment: Practical (P) 

Abstract: The processor includes a buffer memory each addressable 
location of which stores data and an identifier segment. The memory 
address generator provides a locator segment identifying a memory location 
and a tag segment, the address selector responds to the address locator 
segment only to extract the data and identifier segment from the 
corresponding memory location, and this identifier segment is compared 
with the address tag segment to enable gates transferring the data to a 
computer when equality is found. 

Subfile: C 

Descriptors: data handling; digital computers; digital storage 
Identifiers: data processor; buffer memory; addressable location; 
identifier segment; memory address generator; locator segment; tag segment 
; address selector ; identifier segment; address ta'g segment; gates; 
equality 

Class Codes: C6130 (Data handling techniques) 



27/5/17 (Item 10 from file: 2) 

DIALOG (R) File 2 : INSPEC 

(c) 2005 Institution of Electrical Engineers. All rts. reserv . 

00113897 INSPEC Abstract Number: C70005997 
Title: Keyword- in -context index for technical literature (KWIC index) 

Author (s ) : Luhn, H.P. 

Book Title: Pioneer of information science, selected works p. 
227-35 

Editor(s): Schultz, C.K. 
Publisher: Macmillan, London, UK 

Publication Date: 1969 Country of Publication: UK 3+320 pp. 
Language: English Document Type: Book Chapter (BC) 

Abstract: A distinction is made between bibliographical indexes for new 
and past literature based on the willingness of the user to trade 
perfection for currency. Indexes giving keywords in their context are 
proposed as suitable for disseminating new information . These can be 
entirely machine-generated and hence kept up to date with the current 
literature. A compatible coding scheme to identify the indexed documents 
is also proposed. In it elements are automatically extracted from the 
usual identifiers of the document so that the coded identifier yields 
a maximum of information while remaining susceptible to normal methods of 
ordering. (First published 1959). 
Subfile: C 

Descriptors: indexing 

Class Codes: C7240 (Information analysis and indexing) 



27/5/19 (Item 2 from file: 94) 

DIALOG (R) File 94 : JICST-EPlus 

(c)2005 Japan Science and Tech Corp(JST). All rts. reserv. 

03592839 JICST ACCESSION NUMBER: 98A0311571 FILE SEGMENT: JICST-E 
Car number recognition technology for vehicle identification. From "kilo" 
to "minute", traffic jam information. 

KANEYAMA KEN J I (1) 
( 1 ) Omron Corp . 

Gazo Rabo, 1998, VOL. 9, NO. 3, PAGE. 18-22, FIG. 6, REF . 6 

JOURNAL NUMBER: L2340AAI ISSN NO: 0915-6755 

UNIVERSAL DECIMAL CLASSIFICATION: 681.3:165 656.1.05 

LANGUAGE: Japanese COUNTRY OF PUBLICATION: Japan 

DOCUMENT TYPE: Journal 

ARTICLE TYPE: Commentary 

MEDIA TYPE: Printed Publication 

ABSTRACT: The method if recognizing a license number is the most effective 
system for identifying a car sine a license plate substantially assures 
uniqueness. This paper introduces the recent trend by focusing on 
concrete examples applied to license number recognition technique and 
the vehicle control field. This paper describes the issues and the 1 
future trends of license number recognition technique, recognition 
algorith, examples applied to license number information , and 
license number recognition . 

DESCRIPTORS: traffic control; character recognition; numerical character; 
imaging; image processing; edge detection; feature extraction ; 
discriminant function; real time processing; automobile; automotive 
fitting 

BROADER DESCRIPTORS: traffic management; management; control; figure 
pattern recognition; pattern recognition; recognition; letter; 
information processing; treatment; detection; extraction ; separation; 
function (mathematics) ; mapping (mathematics) 

CLASSIFICATION CODE (S) : JE07000S; TB01032S 



34/5/8 (Item 8 from file: 8) 

DIALOG (R) File 8:Ei Compendex (R) 

(c) 2005 Elsevier Eng. Info. Inc. All rts . reserv. 

02095485 E.I. Monthly No: EIM8606-035415 

Title: MATCHING STRING PATTERNS IN LARGE TEXTUAL FILES. 

Author: Berkovich, Simon Y . ; Hegazy, Abd El Fatah A. 
Corporate Source: George Washington Univ, Washington, DC, USA 
Conference Title: International Symposium on New Directions in Computing. 
Conference Location: Trondheim, Norw Conference Date: 19850812 
Sponsor: IEEE Computer Soc, Los Alamitos, CA, USA; Norwegian Inst of 
Technology, Trondheim, Norw; Kongsberg Vaepenf abrikk, Norw 
E.I. Conference No.: 07877 

Source: Publ by IEEE, New York, NY, USA. Available from IEEE Service Cent 
(Cat n 85CH2134-5), Piscataway, NJ, USA p 122-127 
Publication Year: 1985 
ISBN: 0-8186-0639-8 
Language: English 

Document Type: PA; (Conference Paper) 
Journal Announcement: 8606 

Abstract: The authors present a general approach that could be efficient 
when searching large textual files for near-matching of a set of patterns. 
The approach is based on a mapping of string segments into key- number 
values. To apply the terms of query against text strings in a single 
pass simultaneously, the input set of patterns is arranged in a hash table. 
The tolerance property of hash collisions and pattern representation by 
segment extraction can be used to detect different classes of string 
variations. 12 refs. 

Descriptors: * DATABASE SYSTEMS; DATA PROCESSING- -File Organization 

Identifiers: STRING PATTERN MATCHING; LARGE TEXTUAL FILES; DATABASE 
SEARCHING; HASH TABLES 

Classification Codes: 

723 (Computer Software) 

72 (COMPUTERS 6c DATA PROCESSING) 



34/5/15 (Item 1 from file: 2) 

DIALOG (R) File 2 : INSPEC 

(c) 2005 Institution of Electrical Engineers. All rts . reserv. 

6787568 INSPEC Abstract Number: C2001-01-6180N-017 
Title: Probability-based Chinese text processing and retrieval 

Author(s): Hiangji Huang; Robertson, S.; Cercone, N . ; An, A. 
Author Affiliation: Dept. of Inf. Sci . , City Univ., London, UK 
Journal: Computational Intelligence vol.16, no. 4 p. 552-69 
Publisher: Blackwell Publishers, 

Publication Date: Nov. 2000 Country of Publication: USA 

CODEN: COMIE6 ISSN: 0824-7935 

SICI : 0824-7 93 5 (200011) 16 : 4L. 552 : PBCT; 1-7 

Material Identity Number: P953 -2000- 004 

Language: English Document Type: Journal Paper (JP) 

Treatment: Theoretical (T) 

Abstract: We discuss the use of probability-based natural language 
processing for Chinese text retrieval . We focus on comparing different 
text extraction methods and probabilistic weighting methods. Several 

document processing methods and probabilistic weighting functions are 
presented. A number of experiments have been conducted on large standard 
text collections. We present the experimental results that compare a 
word-based text processing method with a character-based method. The 
experimental results also compare a number of term-weighting functions 
including both single-unit weighting and compound-unit weighting functions. 
(15 Refs) 

Subfile: C 

Descriptors: information retrieval; natural languages; text analysis 

Identifiers: Chinese text processing; natural language processing; 
Chinese text retrieval; text extraction ; probabilistic weighting 

Class Codes: C6180N (Natural language processing); C4210L (Formal 
languages and computational linguistics) ; C7250 (Information storage and 
retrieval ) 

Copyright 2000, I EE 
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File 350:Derwent WPIX 1963 -2005/UD, UM &UP=200535 
(c) 2005 Thomson Derwent 



15/5/6 (Item 6 from file: 350) 

DIALOG (R) File 350:Derwent WPIX 

(c) 2005 Thomson Derwent . All rts . reserv. 

012642683 **Image available** 
WPI Acc No: 1999-448788/199938 
XRPX Acc No: N99-335274 

Data file request processing system for client server network - 

selects specific starting method among several registered methods, in 

response to data file request 
Patent Assignee: FUJI XEROX CO LTD (XERF ) 
Number of Countries: 001 Number of Patents: 001 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

JP 11184813 A 19990709 JP 97352612 A 19971222 199938 B 

Priority Applications (No Type Date) : JP 97352612 A 19971222 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
JP 11184813 A 34 G06F-015/00 

Abstract (Basic) : JP 11184813 A 

NOVELTY - A file name analyzer (202) extracts method identifier 
and group identifier , from qualified name. A client management table 
registers identifier of one or more clients, corresponding to group 
identifier of client. A starting method is selected among several 
registered methods, in response to data file request. DETAILED 
DESCRIPTION - A specific execution method is identified within one or 
more methods to be started in the server. The server extracts the 
data file name within the request received from client. 
USE - For client-server network. 

ADVANTAGE - The information relating to data updation, is 
delivered to each client appropriately even under heavy sharing 
condition. DESCRIPTION OF DRAWING (S) - The figure shows block diagram 
of data communication system. (202) Analyzer. 
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Search information display method in hypermedia system - involves 

searching data outputs and extracting structure identifier and 

displaying search result to client 
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Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 
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Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
JP 11149479 A 7 G06F-017/30 

JP 2965018 B2 7 G06F-017/30 Previous Publ . patent JP 11149479 

Abstract (Basic) : JP 11149479 A 

NOVELTY - A structure name corresponds to each structure 
identifier and for each attribute registered mutual relationship 
between node and link of each structure identifier is shown to 
hierarchical structure. The search result is obtained by the 
directory server searching data outputs and the structure 
identifier is extracted and is displayed to client. DETAILED 
DESCRIPTION - An INDEPENDENT CLAIM is also included for search 
information display apparatus. 

USE - For displaying search inf ormationO in hypermedia system. 

ADVANTAGE - As search result is obtained by the server searching 
data outputs and extracts structure identifier and required 
information can be selected easily. DESCRIPTION OF DRAWING (S) - The 
figure shows the block diagram of search media of a directory server. 
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Patent Details: 
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Div ex patent EP 601860 
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Abstract (Basic) : EP 918283 A2 

NOVELTY - The server involves data which is included in a job 
request, and an output unit selecting portion. The output selecting 
portion includes a capability storage unit for storing a capability at 
each output unit in the system; a capability identifier extracting 
unit for extracting an identifier specifying a capability of the 
selected output unit from the job request; and an optimal output unit 
detecting unit for detecting all optimal output units whose 
capabilities coincide with a capability specified by the 
capability-specifying- identifier when the output unit selecting 
portion selects the output units to be as the selected output unit. 

DETAILED DESCRIPTION - The server involves a holding unit for 
holding output unit information that represents correspondence 
between the output units and the output information ; a job request 
receiving portion for receiving from one of the clients a job request 
containing output information and an information extracting 
portion for extracting the output information from the job request; 
an output unit selecting unit for selecting one of the output units 
in accordance with the extracted output information ; and data 
output portion for sending data to the selected output unit. 

USE - For connecting clients and output units e.g. printer, plotter 
or facsimile connected to the server in a system, with clients issuing 
job requests containing output information . 

ADVANTAGE - Can easily select adequate output unit for job 
request, even when client has no prior knowledge of any output unit 
connected to that server. Ensures rational use of output units by 
judging their current status and selecting the most appropriate one. 

DESCRIPTION OF DRAWING (S) - The drawing shows a diagram to 
illustrate the server. 
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Information classification judging method in information providing 
system connected to network - involves assigning temporary information 
classification to be true classification, if information classification 
with specific pattern is not obtained from information file 

Patent Assignee: NIPPON TELEGRAPH & TELEPHONE CORP (NITE ) 

Number of Countries: 001 Number of Patents: 002 
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Patent No Kind Date Applicat No Kind Date Week 
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Patent Details: 
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Abstract (Basic) : JP 10312326 A 

The method involves extracting a fixed portion from an input 
identifier . A temporary information classification is obtained from 
an information file (17) based on the extracted fixed portion using 
an information classification correspondence unit (13) . 

A contents identification unit (14) identifies whether a specific 
pattern is provided in a predetermined area of the information file, 
which is extracted based on the identifier . A true information 
classification is obtained by executing the information file as a 
script using a script executing unit (15) . When the information 
classification with the specific pattern is not obtained, the temporary 
classification is assigned to be the true information classification 
of the information file. 

ADVANTAGE - Performs dynamic selection of information 
classification by script execution. Offers correct information 
classification. 
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Relational data base management method with document search 
function - involves extracting first and second record identifier 
from document number and attribute value of search request signal 
based on which document data from data base is searched 
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Number of Countries: 001 Number of Patents: 001 

Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 
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Priority Applications (No Type Date) : JP 96117311 A 19960513 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
JP 9305622 A 15 G06F-017/30 

Abstract (Basic) : JP 9305622 A 

The method involves receiving data searching request signal 
which contain the document number in the keyword, from an input unit 
(1) . Based on the received request signal, the first record identifier 
for the data to be searched is extracted . The second record 
identifier from the attribute data included in the conditional 
expression of the searched request signal is extracted . 

Based on the extracted first and second record identifier , the 
document corresponding to the selected data record is extracted 
from a data base. 

ADVANTAGE - Improves efficiency of data searching process. 
Enables easy identification of record identifier from search 
request signal. 
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ABSTRACT 

PROBLEM TO BE SOLVED: To extract an adequate part as a document element 
from a document image and give a proper identifier by extracting 

document elements from the document image, line by line. 



SOLUTION: When an area extracting means 17 extracts plural partial 
areas from the document image and a feature extracting means 18 

extracts features from the extracted partial areas, an area array means 
19 arrays the partial areas in order according to the features at the 

extraction positions. An area dividing means 20 divides the respective 
arrayed partial areas as line areas, line by line, and a matching 

decision means 21 assigns the divided line areas to document elements 
set in a document element dictionary 15 according to the matching of 
the features. Thus, line areas assigned repeatedly to plural document 

elements among the line areas which are assigned to document areas are 

selected by an element extracting means 22 according to the mutual 
position relation. 
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ABSTRACT 

PURPOSE: To simplify a program and to improve program generating efficiency 
by executing the different program by the same message identifier when a 
reception message with the message identifier is received from terminal 
equipment provided with a terminal address . 

CONSTITUTION: An execution program is selected by storing information 
for extracting the message identifier of the reception message 3 from 
the terminal equipment 1 in a message identifier extracting table 30, 
in addition generating the combination ' of the terminal address and the 
message identifier and the name of the execution program corresponding to 
it in an execution program determining table 40 and storing these two 
tables 30, 40. Accordingly, even in the case where the messages with the 
same message identifier are received from plural terminal equipment 1, 
the different programs corresponding to the respective terminal equipments 
1 can be executed. Thus, the program becomes simple, and the program 
generating efficiency is improved. 
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fuzzy queries for classifying blood vessels lesions and tumors by 
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Abstract (Basic) : WO 200005663 A2 

NOVELTY - A number of features is extracted from query , while 
each of the features is fragmented into feature fragments each of which 
is hashed into hashed feature fragments. The latter can be used in 
accessing a hash table for obtaining object identifiers from it that 
can be used for obtaining information from the database relevant to 
the query . 

DETAILED DESCRIPTION - The method involves. 

(a) selecting a first one of a number of home nodes; 

(b) extracting , by the selected home node, a number of features 
from a query by a user; 

(c) fragmenting, by the selected home node, each extracted 
feature of the number of extracted features into a number of query 
fragments ; 

(d) hashing, by the selected home node, each query fragment of 
the number of query fragments, the hashed query fragment having a 
first portion and a second portion; 

(e) transmitting, by the selected home node, each hashed query 
fragment of the number of query fragments to a respective one of the 



number of query nodes indicated by the first portion of each the 
hashed query fragment; 

(f) using, by the query node, the second portion of the 
respective hash query fragment to access data according to a local 
hash table located on the query node; and 

(g) returning, by each query node accessing data according to 
the respective hashed query fragment, a number of object identifiers 

corresponding to the accessed data to the selected home node. 
INDEPENDENT CLAIMS are included for: 

(a) a distributed computer database system having an information 
retrieval tool for handling queries from a user 

(b) an information retrieval apparatus for processing query 
for word based and non-word based retrieval of information from 
database 

(c) a computer program for processing query for word based and 
non-word based retrieval of information from a database 

(d) an information indexing system for indexing information for 
facilitating retrieval from database 

(e) a computer program for indexing information for facilitating 
retrieval from database 

USE - In distributed computer database. 

ADVANTAGE - Provides an information retrieval system that can 
retrieve information from a unified database of word and non-word 
based information , including documents , images and other forms of 
multimedia, using a single indexing system. Such information 
retrieval systems preferably may be highly scalable, versatile, robust 
and economical . 

DESCRIPTION OF DRAWING (S) - The drawing is a block diagram of an 
embodiment of the present invention, 
user computer (102) 
link (103) 
home nodes (106) 
local area network (108) 

query nodes (109) 
object nodes (110) 
external servers (111) 
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Message faxing method for directory services over internet 
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Abstract (Basic) : US 5944769 A 

NOVELTY - A map and an indicator on the map are displayed afte 
receiving information from server. Direction from starting location 
given as input is displayed on selected location. A facsimile icon 
using facsimile number is selected from database for faxing message 
to selected location. 

DETAILED DESCRIPTION - A business name and corresponding location 
stored in database satisfying user query is selected from business 
name which are received from server and displayed. The facsimile numbe 
is stored in database corresponding to the business names and 
locations. A business identifier which is business icon is stored in 
database corresponding to several business names and is displayed. An 
INDEPENDENT CLAIM is also included for the systems for communicating 
over a network. 

USE - For providing directory services over internet. 

ADVANTAGE - Since business directory and map database are 
integrated, the user has facility of searching business directory 
using map database with the radius feature to quantify the search 
and obtains directions from specified user location to a selected 
search result by single website access. 

DESCRIPTION OF DRAWING (S ) - The figure shows the flowchart for 
interactions between client and server for search and map 
generation . 
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ABSTRACT 

PROBLEM TO BE SOLVED: To automatically extract information , needed to 
solve a problem written in an electronic mail, from stored information 
and display it by selecting the electronic mail. 

SOLUTION: When the input of a test data display indication is received, 
test data on a measurement item predetermined corresponding to the 
problem contents that a selected problem communication mail includes are 
extracted from the test data group that the data identifier 
included in the mail identifier of the selected problem communication 
mail and then displayed. Further, a display of instance data is 
requested, and a problem communication mail and a problem countermeasure 
mail including the problem contents of the selected communication mail 
are extracted from stored problem communication mails and problem 
countermeasure mails and the displayed. 
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ABSTRACT 

PURPOSE: To retrieve an object just by referring to a storage means and 
to efficiently perform retrieval in a short time by extracting partial 
information for respective recording media, working it and storing it in 
the storage means. 

CONSTITUTION: A location information data base 22 stores information 
for indicating which DC-ROMs 10 and the respective objects are stored in. A 

retrieval part 33 retrieves a data base 21 for the retrieval based 
on a retrieval command and a retrieval formula inputted from an input 
device 15 and obtains the document ID number of the object for 

satisfying the conditions of the retrieval formula. Further, the location 

information data base 22 is referred to and which CD-ROM 10 the object 
for satisfying the conditions of the retrieval formula is stored in is 

retrieved . A selection part 34 selects the object based on the 

information for indicating which DC-ROM 10 the object is stored in along 
with the document ID number of the object from the retrieval part 
38 and instructs a reader 13 to read the information of the object. 
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DA894-0079 DAM 
O. W. Weber 

IBM* Tec hnical Disclosure Bulletin 
Resolving Ambiguous Parsing Rules 



Vol.37 No. 11 November 1994 



Disclosed is a system which ensures the appropriate mapping of ambiguous address book fields 
for use by a synchronization program. This is accomplished by defining system default rules to 
either accommodate known exceptions or to inform the user of potential error situations. 

Products such as ADS/2* synchronize one address book with another, such as synchro- 
nizing a ccMail** data base with a VM Callup data base. To prepare for the synchronization, the 
system administrator must define how the fields of each data base are going to be mapped to each 
other. Parse rules are made available to break a source field into multiple master fields. This 
allows mapping to be accomplished at a more granular level and allows users to move data from 
directories supporting different formats while maintaining the correct format in each environment. 
Examples of types of parse rules that may prove useful are: 

• Breaking a complete name into its component parts. 

• Breaking a phone number into area code, exchange, and number. 

Following is an example of how one might parse a telephone number into its component parts: 

Source field: PHONE: (817)555-1212: IMAREA! 

IMEXCHANGE! 
IMNUMBER! 

The following parse rule will fill the destination fields with the correct data: 

(!MAREA!) IMEXCHANGEMMNUMBER! 

However, a problem exists if there is a record in the data base that doesn't conform to this 
mapping. For example, phone numbers are sometimes entered without any punctuation, such as 
the parameters for making a call via a modem. In cases like these, the synchronization program is 
broken, either terminating or mapping the fields incorrectly unbenounced to the user. The dis- 
closed system ensures the appropriate mapping even in these ambiguous situations by defining 
system default rules to either accommodate known exceptions or to inform the user of potential 
error situations. 

Continuing with the above phone number example, the administrator can define the fol- 
lowing system default rules, based upon the length (number of non-blank characters contained in) 
the phone number field: 
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Resolving Ambiguous Parsing Rules - Continued 



Length Assumed Format Example 



>13 Message to user 
13 { ! MAREA ! ) ! HEXCHANGE ! - ! MNUMBER ! (8 17) 555- 1212 
12 ( ! MAREA ! ) ! HEXCHANGE ! IMNUMBER! (817)5551212 
11 ! MAREA! ! MEXCHANGE! -IMNUMBER! 817555-1212 
19 ! MAREA! ! HEXCHANGE! IMNUMBER! 8175551212 
9 Message to user 

8 ! MEXCHANGE ! - ! MNUMBER ! 555-1212 
7 ! MEXCHANGE! IMNUMBER! 5551212 
6 !MPREFIX!-!MNUMBER! 5-1212 
5 JMPREFIX! IMNUMBER! 51212 
4 IMNUMBER! 1212 
<4 Message to user 

Now if a non-standard entry is detected in a phone field, the system will map the field 
accor4Uig to the appropriate system default rule. For example, if '8175551212 is detected, the 
system removes any blank characters, determines the length of the field (10), automatically 
assumes that the rule to be used is "IMAREAHMEXCIIANGFJIMNUMBERI, and assumes that 
817 is the area code, 555 is the exchange, and 1212 is the number. 

With this method, the data integrity of the mapped fields is maintained. Of course, the 
phone number scenario is only one example, as is the keying off the length of the field. 

Also disclosed is the ability to automatically detect other non -conforming fields. For 
example, if the directory field, NAME, contains data in the following format: 

Smith, R. (John) 

the following is a typical parse rule that would correctly parse the name field: 
ILNAME!, !MI! (!FNAME!) 

The LNAME, MI, and FNAME are master fields. For the parse rule above, they would 
contain: 

LNAME = Smith 
FNAME = John 
Ml = R. 

However, a problem exists if there is a record in the data base that doesn't conform to any 
of the defined mappings. For example, if a record contains a NAME field as follows: 

Smith, J.R. (John) 

as some data bases do, then the middle initial incorrectly gqts mapped as : 
MI - J.R. 
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Resolving Ambiguous Parsing Rules - Continued 



The disclosed system detects that the "V in the fust middle initial matches the first letter of 
the first name 'John'. Then depending upon the prcconfigured profile, the system will take one 
of the following actions: 

• Automatically assume that the "V in the first middle initial is redundant with the first letter of 
the first name 'John*, and that this person's name is 'John R. Smith*. Fields will then be 
mapped accordingly. 

• Surface this exception to the administrator, who then decides whether this person's name is 
'John R. Smith* or 'John J. R. Smith*. Fields will then be mapped accordingly. 

Also disclosed is the automatic search of the data base, and subsequent warning to the 
administrator upon detection of non-conforming data. The administrator is then allowed to inter- 
vene to ensure an accurate mapping of the non-conforming field. This process can either be per- 
formed at mapping time or at synchronization time. 

♦ Trademark of IBM Corp. 
Trademark of Lotus Corp. 
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